# Drivers of nonnative fish invasion patterns differ by species origin and waterbody type

##The spreadsheet files (xlsx and csv) contain the data used in this study, titled "Drivers of nonnative fish invasion patterns differ by species origin and waterbody type."

##Description of the data and file structure

1. "fish_occurrence_data.xlsx" Yunnan fish occurrence data, contains three columns: species name (species), seventh-level sub-basin ID (HYBAS_ID), and species origin category (catg).

2. "full_alien.xlsx" drivers of occurrence, includes worksheet: "full_alien", worksheet: "lotic", worksheet: "lentic". The dataset includes the following variables: species name (ex_sp), sub-basin ID (HYBAS_ID), mean functional distance (mean_dis_f), nearest functional distance (min_dis_f), mean phylogenetic distance (mean_dis_p), nearest phylogenetic distance (min_dis_p), precipitation RAO Q (pre_rao), discharge RAO Q (dis_rao), temperature RAO Q (tem_rao), Human Footprint Index (HFP), native species richness (Richness), sub-basin area (area), mean elevation (elevation), waterbody type (type), and species occurrence (0–1) data (occ).

3. "full_translocated.xlsx" drivers of occurrence, includes worksheet: "full_translocated", worksheet: "lotic", worksheet: "lentic". The dataset includes the following variables: species name (ts_sp), sub-basin ID (HYBAS_ID), mean functional distance (mean_dis_f), nearest functional distance (min_dis_f), mean phylogenetic distance (mean_dis_p), nearest phylogenetic distance (min_dis_p), precipitation RAO Q (pre_rao), discharge RAO Q (dis_rao), temperature RAO Q (tem_rao), Human Footprint Index (HFP), native species richness (Richness), sub-basin area (area), mean elevation (elevation), waterbody type (type), and species occurrence (0–1) data (occ).

4. "all_contribution.xlsx" The relative contribution of driving factors across the six models.  includes worksheet: "alien_contribution_results"; "trans_contribution_results"; "lotic_a_contribution_results"; "lotic_t_contribution_results"; "lentic_a_contribution_results"; "lentic_t_contribution_results".

5. "nonnative_funcation_matrix.csv" Functional distance similarity matrix for 94 nonnative fish species

6. "tree_data.xlsx" The dataset includes 94 nonnative fish species (species), along with their corresponding genera (genus), families (family), and their nearest genera (close_genus).

7. "PLOT2.xlsx" The dataset for 94 nonnative fish species’ phylogenetic and functional trees contains the following columns: species name (species); taxonomic order (order); origin category (catg); occurrence abundance (Abundance); phylogenetic root node (node); mean functional distance to 202 sub-basins (F_Mean); minimum functional distance to 202 sub-basins (F_Min); mean phylogenetic distance to 202 sub-basins (P_Mean); and minimum phylogenetic distance to 202 sub-basins (P_Min).

8. The results of six structural equation models are provided in six CSV files: "whole_alien," "whole_translocated," "lotic_alien," "lotic_translocated," "lentic_alien," and "lentic_translocated." Each table contains the following columns: response variable name (Response), predictor variable name (Predictor), p-value (P.Value), standardized estimate (Std.Estimate), and significance (Significance).

9. The indirect effect results of six structural equation models are provided in the following six CSV files: "whole_alien_indirect," "whole_translocated_indirect," "lotic_alien_indirect," "lotic_translocated_indirect," "lentic_alien_indirect," and "lentic_translocated_indirect." Each table includes the following columns: variable names (from_to_clean), direct effect values (Direct_Effect), and total indirect effect values (total_indirect_effect).
